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Abstract — In this paper, a channel that is contaminated by two 
independent Gaussian noises S ~ Af(0, Q) and Zo ~ J\T(0,No) 
is considered. The capacity of this channel is computed when 
independent noisy versions of S are known to the transmitter 
and/or receiver. It is shown that the channel capacity is greater 
then the capacity when S is completely unknown, but is less 
then the capacity when S is perfectly known at the transmitter or 
receiver. For example, if there is one noisy version of S known at 
the transmitter only, the capacity is \ log(l+ q(n^/(q+n 1 ))+n )^ 
where P is the input power constraint and Ni is the power of 
the noise corrupting S. Further, it is shown that the capacity 
with knowledge of any independent noisy versions of S at the 
transmitter is equal to the capacity with knowledge of the 
statistically equivalent noisy versions of S at the receiver. 

Index Terms — Dirty paper coding, achievable rate, interfer- 
ence mitigation, Gaussian channels. 

I. INTRODUCTION 

Consider a channel in which the received signal, Y is 
corrupted by two independent additive white Gaussian noise 
(AWGN) sequences, S - W(0,QI„) and Z - Af(0,N o I n ), 
where I„ is the identity matrix of size n. The received signal 
is of the form, 

Y = X + S + Z , (1) 

where X is the transmitted sequence for n uses of the 
channel. Let the transmitter and receiver each has knowledge 
of independent noisy observations of one of the noises, S. We 
quantify the benefit of this additional knowledge by computing 
the capacity of the channel in (HJ and presenting the coding 
scheme that achieves capacity. Our result indicates that the 
capacity is of the form | log(l + ). where < [i < 1 

is the residual fraction (explicitly characterized in Section Hill 
of the interference power, Q, that can not be canceled with 
the noisy observations at the transmitter and receiver. 

One special case of our model is when a noisy version of 
S is known only to the transmitter; our result in this case is 
a generalization of Costa's celebrated result [1]. In [1], it is 
shown that the achievable rate when the noise S is perfectly 
known at the transmitter is equivalent to the rate when S 
is known at the receiver, and this rate does not depend on 
the variance of S. A new coding strategy to achieve this 
capacity was also introduced in [1] and is popularly referred 
to as dirty paper coding (DPC). We generalize Costa's result 
to the case of noisy interference knowledge. We show that 
the capacity with knowledge of a noisy version of S at 
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the transmitter is equal to the capacity with knowledge of 
a statistically equivalent noisy version of S at the receiver. 
However, unlike [1] where the capacity does not depend on 
the variance of S, in the general noisy side information case, 
the capacity decreases as the variance of S increases. 

We also compute the capacity when multiple independent 
noisy observations of S are available at the transmitter and 
receiver. We show that in this case, it is sufficient for the 
transmitter and receiver to each compute maximum-likelihood 
estimates of S based on their observations and then use 
these estimates to achieve capacity using the coding strategy 
proposed in Section ITl-CI Further, it is shown that the capacity 
of a Gaussian channel with multiple independent observations 
of S known at the transmitter is equal to the capacity with 
statistically similar observations known at the receiver. 

The proposed model can have several potential applications. 
For instance, consider the following scenario. Node A is 
transmitting to node B, but whose signal is also received at 
nodes C and D who are communicating with each other. Thus, 
nodes C and D can use the noisy estimate of user A's signal 
to improve the rate at which they communicate. 

In [1], Costa adopted the random coding argument given by 
Gel'fand and Pinsker [2] and El Gamal and Heegard [3]. In 
[2], the capacity of a discrete memoryless channel with side 
information at the encoder is derived and the result has been 
extended to the case when each of the encoder and decoder 
has one of two correlated side information [4]. Based on the 
channel capacity C — Tn&yL p ( ux \ s ){I(U\Y) — I(U,S)} given 
in [2], [3], Costa constructed the auxiliary variable U as a 
linear combination of X — 7V(0, P) and S — 7V(0, Q) and 
showed that this simple construction of U achieves capacity. In 
our proof for achievability, we also follow the arguments from 
[2], [3]. Further, similar to [1], the optimal auxiliary variable 
in our case is also a linear combination of X and the noisy 
observations of S at the encoder. Thus, our coding scheme 
can be viewed as a variation of DPC. 

Following the pioneering work of Costa, several extensions 
of DPC have been studied, e.g., colored Gaussian noise [5], 
arbitrary distributions of S [6] and deterministic sequences [7]. 
In [8], DPC has been applied to the AWGN and jitter channel 
and the rate loss due to imperfect synchronization at the 
decoder has been evaluated. DPC has also been extensively 
applied to watermarking or information embedding [9], [10] 
applications and for the Gaussian broadcast channel [11], [12]. 
In [13] the authors consider the case when S is perfectly 
known to the encoder and a noisy version is known to the 
decoder. Most of the analysis in [13] is for discrete memory- 
less channels and also causal knowledge of interference. The 
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only result in [13] for Gaussian channel shows that there is no 
additional gain due to the presence of the noisy estimate at the 
decoder, since perfect knowledge is available at the encoder 
and a DPC can be constructed. In contrast, in this paper we 
study the case when only noisy knowledge of S is available 
at both transmitter and receiver. 

Throughout this paper we only consider the case of non- 
causal knowledge of the interference. Causal extensions to 
these should be studied in future work. The rest of this paper is 
organized as follows: Section El introduces the system model 
and also gives the main capacity result. Extensions of the result 
to single and multiple noisy observations are evaluated in 
Section Hn] Brief concluding remarks are given in Section HVl 

II. SYSTEM MODEL, CAPACITY AND 
ACHIEVABILITY 

In this section, we first introduce the channel model and 
then compute the capacity of this channel. 



capacity of this channel equals ~ log ^1 



where 



Remark: It is clear that \i = when either iVi = or 
N2 = 0. Consequently, the capacity is j log ^1 + jj^j, which 
is consistent with the result in [ 1 Further, /1 = 1 when Ni 
00 and N2 — > 00, and the capacity is | log ( 1 1 " 
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which is the capacity of a Gaussian channel with noise 
variance Q + Nq. Thus, one can interpret p, as the residual 
fractional power of the interference that cannot be canceled 
by the noisy observations at the transmitter and receiver. 

Proof: We first compute an outer bound on the capacity of 
this channel. It is clear that the channel capacity can not exceed 
m&x p ( x \ mim ^ I(X;Y\Mi, M2), which is the capacity when 
both Mi and M 2 are known at the transmitter and receiver. 
Thus, a capacity bound of the channel can be calculated as 



A. Channel Model 

Z^M{0,n,) S~JV(0,Q) 



W 



I(X; Y\M U M 2 ) = I(X; Y, M U M 2 ) - I(X; M X ,M 2 ) 
< I(X;Y,M 1 ,M 2 ) 

= H(X) + H(Y, Mi, M 2 ) - H(X, Y, M 1; M 2 ) 
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Fig. 1. Channel model. 



where fi 



HQ + No 



N 2 



(3) 

Note that the inequality in (0 is 



The channel model is depicted in Fig.Q] The communication 
problem is to send an index, W £ {1, 2, . . . , K }, to the 
receiver in n uses of the channel. The transmission rate is 
R = — log 2 K bits per transmission. The output of the channel 
is given in (Q]i and is contaminated by two independent AWGN 
sequences, S - 7V(0,QI„) and Z - Af(0,N I n ). Side 
information Mi = S + Zi, which is noisy observations of 
the interference is assumed to be available at the transmitter 
for all n uses of the channel. Similarly, noisy side information 
M2 = S + Z 2 , is assumed to be available at the receiver for 
all n uses of the channel. The noise vectors are distributed as 
Zi - AA(0, iViI„) and Z 2 - Af(0, N 2 I n ). 

Based on the index W and the observation Mi, the encoder 
transmits one codeword, X, from a (2 nR , n) code book. The 
codeword X must satisfy an average power constraint of the 
form, ^-||X|| 2 < P. Let W be the estimate at the receiver of 
the transmitted index; an error occurs if W =^ W. 

B. Channel Capacity 

Theorem 1: Consider a channel of the form (fTJ with an 
average transmit power constraint P. Let independent noisy 
observations M x = S + Z x and M 2 = S + Z 2 of the 
interference S be available, respectively, at the transmitter and 
receiver. The noise vectors have the following distributions: 
Z 4 - Af{0,N z I n ), i = 0,1,2 and S - A/"(0,QI„). The 



actually a strict equality since I(X; Mi, M 2 ) = 0. 

C. Achievability of Capacity 

We now prove that the capacity computed in (O is achiev- 
able. The codebook generation and encoding method we use 
follows the principles introduced in [2], [3]. The construction 
of auxiliary variable is similar to [1]. 

Random codebook generation: 

1) Generate 2 n/ ( c/;y - M2 ' i.i.d. length-n codewords U, 
whose elements are drawn i.i.d. according to U ~ 
Af(0, P + a 2 (Q + Ni)), where a is a coefficient to be 
optimized. 

2) Randomly place the 2 nI{ - u - Y ' M ^ codewords U into 2 nR 
cells in such a way that each of the cells has the 
same number of codewords. The codewords and their 
assignments to the 2 nR cells are revealed to both the 
transmitter and the receiver. 

Encoding: 

1) Given an index W and an observation, Mi = M x (i), 
of the Gaussian noise sequence, S, the encoder searches 
among all the codewords U in the W th cell to find 
a codeword that is jointly typical with Mi(i). Ac- 
cording to the joint asymptotic equipartition property 

'Costa's result is a special case with Ni = and N2 = 00. 
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(AEP) [14], it is easy to show that if the number 
of codewords in each cell is larger than or equal to 
2 nI ( U ' Ml \ the probability of finding such a codeword 
U = U(i) exponentially approaches 1 as n — > oo. 

2) Once a jointly typical pair (U(i), Mi(i)) is found, the 
encoder calculates the codeword to be transmitted as 
X(i) = U(i)-aMi(i). With high probability, X(i) will 
be a typical sequence which satisfies i||X(i)|| 2 < P. 

Decoding: 

1) Given X(i) is transmitted, the received signal is Y(i) = 
X(i) + S + Zo. The decoder searches among all 
2ni(U;Y,M 2 ) codewords U for a sequence that is jointly 
typical with Y(i). By joint AEP, the decoder will 
find U(i) as the only jointly typical codeword with 
probability approaching 1. 

2) Based on the knowledge of the codeword assignment to 
the cells, the decoder estimates W as the index of the 
cell that U(i) belongs to. 

Proof of achiev ability: 

Let U = X + aMi = X + a(S + Z x ), Y = X + S + Z a 
and M 2 = S + Z 2 , where X ~ 7V(0, P), S ~ Af(0, Q) and 
Zi ~ jV(0, Ni), i — 0, 1, 2 are independent Gaussian random 
variables. To ensure that with high probability, in each of the 
2 nR cells, at least one jointly typical pair of U and Mi can 
be found. The rate, R, which is a function of a, must satisfy 

R(a) < I(U; Y, M 2 ) - I(U; Mi). (4) 

The two mutual informations in can be calculated respec- 
tively as 

I(U;Y,M 2 ) 
= H{U) + H(Y, M 2 ) - H(U, Y, M 2 ) 

= ilo g 2^e [P + c?{Q + Ni)] 
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Substituting © and © into (@), we find 
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(5) 
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We can now find the optimal coefficient a* that maximizes 
the right hand side of using the extreme value theorem. 
After simple algebraic manipulations, the optimal coefficient 
is computed as 



QPN 2 



(8) 



Q(P + N )(N 1 + N 2 ) + (Q + P + N )N 1 N 2 
Substituting for a* into ©, the maximal rate is found to be 



with /i = — q-, which is exactly the upper bound ( 

capacity yjl. 

III. SPECIALIZATION AND GENERALIZATION 



(9) 



The coding scheme to achieve capacity in Section Hl-CI can 
be easily specialized to the case when a single observation 
is available at either the encoder or decoder. Further, it can 
be generalized to the scenario when multiple independent 
observations are available at the encoder and decoder. 



A. Single Observation 



S~Af(0,Q) 



W 




Z ~A<-(0,N ) 

Fig. 2. Channel model with an observation of S at the receiver. 

1 ) Noisy estimate at receiver only: Fig.|2]shows the channel 
model when the observation of S is only available at the 
receiver. This channel is equivalent to our original model when 
N\ — > co. The capacity of the channel is given by 



J(X ; y,M 2 ) = ilog(l ; J' 
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Fig. 3. Channel model with an observation of S at transmitter. 

2) Noisy estimate at transmitter only: Generalization of 
Dirty Paper Coding: Fig. [3] shows the channel model when 
the observation of S is only available at the transmitter. This 
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is a special case of our original model with N 2 — * 00. The 
capacity of the channel is 



/(x ; y|M 1 ) = -io g 1 



^ ( Q+JVi ) 



No 



(11) 



The achievability can be easily shown following the same steps 
as in Section HI-Cl with N 2 — » 00. Note that when Ni = 0, the 
channel model further reduces to Costa's dirty paper coding 
channel model [1]. 
Remarks: 

1) In [1], Costa showed the surprising fact that the channel 
capacity with additive Gaussian interference known to the 
transmitter only is the same as the channel capacity with the 
interference known to the receiver only. This paper extends 
that result to the case of noisy interference. Indeed, by setting 
N\ = N 2 in ([Tol l and (fTTl i. we can see that the capacity 
with noisy interference known to transmitter only equals the 
capacity with a statistically similar noisy interference known 
to receiver only. 

2) From (fTTT i. one may intuitively interpret the effect of 
knowledge of M\ at the transmitter. Indeed, a fraction q+NI 
of the interfering power can be canceled using the proposed 
coding scheme. The remaining q+ 1 Ni fraction of the inter- 
fering power, Q, is treated as 'residual' noise. Thus, unlike 
Costa's result [1], the capacity in this case depends on the 
power Q of the interfering source: For a fixed N±, as Q 
increases, the capacity decreases. The capacity approaches 

5 lo § i 1 + n^ne) with Q °°- 



B. Multiple Independent Observations 
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(12) 

1 N ni J r n 2 



where fi = 

are the variances of the Gaussian noise variables, 
Zi, Z 2 , ■ ■ ■ , Z ni+1l2 , corresponding to the + n 2 

observations. 

Achievability of capacity: To show the achievability, the 
coding process follows the same random coding argument 
as in Section IH-CI The main difference is that in this case, 
we first construct one estimate of the interference at the 
transmitter (similarly at the receiver) based on the multiple 
noisy observations and then use this estimate in the coding 
process. Thus, we omit the detailed development and show 
only the main steps. 

At the transmitter, upon receipt of the m independent obser- 
vations, the encoder makes a maximum likelihood estimation 
(MLE) of S, which is given as 



Si = arg max/s (Mi,M 2 , 



.,M ni \S) 



(13) 



where 



(Mi - sy 

2N, 



fs(Mi,M 2 , . . . , M ni \S) = [J -== exp 

[Li V 2lr Ni 

(14) 

Taking logarithm on both side of ( fT4b and differentiating with 
respect to S, the estimate, §1, which maximizes fs is found 
to be 



Fig. 4. Channel model with multiple independent observations of S. 



Let there be ni independent observations Mi, M2, . . . , 
M ni of S at the transmitter and n 2 independent observations 
M„ 1+ i, M„ 1+ 2, . . . , M„ 1+ „ 2 at the receiver, as shown in 
Fig-El Similar to Section ITl-BI an upper bound on the capacity 
of this channel can be computed as 



I(X;Y\Mi,M 2i . 
< I(X;Y,Mi,M 2 ,. 
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,M, 
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H(X) + H(Y, Mi,M 2 , M ni+n2 ) 
-H(X,Y,M 1 ,M 2 ,...,M ni+7l2 ) 
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Em Nj_ 
fc=l N k 



(15) 



Thus, the variance of the estimation error at the transmitter 
can be computed as 



Ni = Var 



1 



k=l N k ) 1^1=1 N, 



(16) 



Similarly, at the receiver, the decoder also computes the MLE, 
^2 of S based on the n 2 independent observations. The 
variance of the estimation error at the receiver is given by 

N 2 = "t~ . (17) 



En 
1= 
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Thus, using MLE, the multiple observations is equivalent to 
one observation each at the transmitter and receiver with esti- 
mation noise variance Ni and N~2- Essentially, the construction 
of the auxiliary variable U is similar to that in Section III-CI 
using Ni and N2, i.e., U — X + aS\. The achievable rate, 
R*, can then be found by substituting N% and N2 instead of 
Ni and N2, respectively, in (0 and is given by, 



R* 
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log 1 
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Clearly, R* 



where ft = — 

is the same as the upper bound on capacity given in (fT2l . 

Remark: It is easy to see that the capacity expression 
is symmetric in the noise variances at the transmitter and 
receiver. In other words, having all the n\ + ri2 observations 
at the transmitter would result in the same capacity. Thus, 
the observations of S made at the transmitter and the receiver 
are equivalent in achievable rate, as long as the corrupting 
Gaussian noises have the same statistics. Further, if there 
is an extra independent observation of S with corrupting 
noise variance 7V„ 1+ „ 2+ i, the fraction ft in (fT8l decreases 

i . Thus, the irremovable 
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part of the noise power Q decreases and the achievable rate 
increases. It is clear that it does not matter whether this extra 
observation is obtained at the transmitter or the receiver. 
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IV. CONCLUSION 

In this paper, we derived the capacity region for a Gaussian 
noise channel with additive Gaussian interference, when noisy 
estimates of the interference are known at the transmitter and 
receiver. Our results indicate that knowledge of the interfer- 
ence at the transmitter gives the same capacity as knowledge 
of statistically similar interference at the receiver. As noted 
earlier, all results in this paper are derived assuming non-causal 
knowledge of the noisy interference. Studying capacity with 
causal knowledge of the interference should be investigated in 
future work. 
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